Figure 1. N-linke 




saccharides for glycosylation 



Peptide sequence (SEQ ID NO: 
MPFVNKQFNY KDPVNGVDIA 
PPPEAKQVPV SYYDSTYLST 
STIDTELKVI DTNCINVIQP 
GSTQYIRFSP DFTFGFEESL 
RVFKVNTNAY YEMSGLEVSF 
KSIVGTTASL QYMKNVFKEK 
LNRKTYLNFD KAVFKINIVP 
GLFEFYKLLC VRGIITSKTK 
ITSDTNIEAA EENISLDLIQ 
KKYELDKYTM FHYLRAQEFE 
AMFLGWVEQL VYDFTDETSE 
AVILLEFIPE lAIPVLGTFA 
VNTQIDLIRK KMKEALENQA 
MININKFLNQ CSVSYLMNSM 
VNNTLSTDIP FQLSKYVDNQ 
GSKVNFDPID KNQIQLFNLE 
EYTIINCMEN NSGWKVSLNY 
NNRLNNSKIY INGRLIDQKP 
EKEIKDLYDN QSNSGILKDF 
GSVMTTNIYL NSSLYRGTKF 
GVEKILSALE IPDVGNLSQV 
LVASNWYNRQ lERSSRTLGC 



5): 

YIKIPNAGQM 
DNEKDNYLKG 
DGSYRSEELN 
EVDTNPLLGA 
EELRTFGGHD 
YLLSEDTSGK 
KVNYTIYDGF 
SLDKGYNKAL 
QYYLTFNFDN 
HGKSRIALTN 
VSTTDKIADI 
LVSYIANKVL 
EATKAIINYQ 
IPYGVKRLED 
RLLSTFTEYI 
SSKIEVILKN 
GEIIWTLQDT 
ISNLGNIHAS 
WGDYLQYDKP 
IIKKYASGNK 
VVMKSKNDQG 
SWEFIPVDDG 



QPVKAFKIHN 
VTKLFERIYS 
LVIIGPSADI 
GKFATDPAVT 
AKFIDSLQEN 
FSVDKLKFDK 
NLRNTNLAAN 
NDLCIKVNNW 
EPENISIENL 
SVNEALLNPS 
TIIIPYIGPA 
TVQTIDNALS 
YNQYTEEEKN 
FDASLKDALL 
KNIINTSILN 
AIVYNSMYEN 
QEIKQRWFK 
NNIMFKLDGC 
YYMLNLYDPN 
DNIVRNNDRV 
ITNKCKMNLQ 
WGERPL 



KIWVIPERDT 
TDLGRMLLTS 
IQFECKSFGH 
LAHELIHAGH 
EFRLYYYNKF 
LYKMLTEIYT 
FNGQNTEINN 
DLFFSPSEDN 
SSDIIGQLEL 
RVYTFFSSDY 
LNIGNMLYKD 
KRNEKWDEVY 
NINFNIDDLS 
KYIYDNRGTL 
LRYESNHLID 
FSTSFWIRIP 
YSQMINISDY 
RDTHRYIWIK 
KYVDVNNVGI 
YINVVVKNKE 
DNNGNDIGFI 



FTNPEEGDLN 
IVRGIPFWGG 
EVLNLTRNGY 
RLYGIAINPN 
KDIASTLNKA 
EDNFVKFFKV 
MNFTKLKNFT 
FTNDLNKGEE 
MPNIERFPNG 
VKKVNKATEA 
DFVGALIFSG 
KYIVTNWLAK 
SKLNESINKA 
IGQVDRLKDK 
LSRYASKINI 
KYFNSISLNN 
INRWIFVTIT 
YFNLFDKELN 
RGYMYLKGPR 
YRLATNASQA 
GFHQFNNIAK 



Peptides containing the motif 'N-X-S/T/C (X not P)'(underlined): 



position 


Peptide 


SEQ ID NO: 


167-177 


SFGHEVLNLTR 


6 


382-393 


VNYTIYDGFNLR 


7 


394-415 


NTNLAANFNGQNTEINNMNF TK 


8 


418-427 


NFTGLFEFYK 


9 


457-477 


VNNWDLFFSPSEDNFTNDLN K 


10 




GEEITSDTNIEAAEENiSLDLIQQYYLTFNFDNEPENlSI 


11 


478-536 


EN LSSDIIGQLELMPNIER 




773-779 


LNESINK 


12 


787-806 


FLNOCSVSYLMNSMIPYGVK 


13 


841-855 


VNNTLSTDIPFQLSK 


14 


872-882 


NIINTSILNLR 


15 


930-948 


NAIVYNSMYENFSTSFWIR 


16 


952-975 


YFNSISLNNEYTIINCMENN SGWK 


17 


1001-1013 


YSQMIKISDYINR 


18 


1024-1028 


LNNSK 


19 


1086-1098 


DLYDNQSNSGILK 


20 


1141-1156 


GSVMTTNIYLN SSLYR 


21 


1193-1204 


LATNASQAGVEK 


22 


1205-1224 


ILSALEIPDVGNLSQVVVMK 


23 



Figure 1 Continue. 



Peptide sequence (SEQ ID NO:39): 
KTKSLDKGYN KALNDLCIKV NNWDLFFSPS 
LIQQYYLTFN FDNEPENISI ENLSSDIIGQ 
EFEHGKSRIA LTNSVNEALL NPSRVYTFFS 
TSEVSTTDKI ADITIIIPYI GPALNIGNML 
TFALVSYIAN KVLTVQTIDN ALSKRNEKWD 
NQAEATKAII NYQYNQYTEE EKNNINFNID 
NSMIPYGVKR LEDFDASLKD ALLKYIYDNR 
DNQRLLSTFT EYIKNIINTS ILNLRYESNH 
NLESSKIEVI LKNAIVYNSM YENFSTSFWI 
LNYGEIIWTL QDTQEIKQRV VFKYSQMINI 
QKPISNLGNI HASNNIMFKL DGCRDTHRYI 
KDFWGDYLQY DKPYYMLNLY DPNKYVDVNN 
TKFIIKKYAS GNKDNIVRNN DRVYINVVVK 
SQWVMKSKN DQGITNKCKM NLQDNNGNDI 
LGCSWEFIPV DDGWGERPL 



EDNFTNDLNK GEEITSDTNI EAAEENISLD 
LELMPNIERF PNGKKYELDK YTMFHYLRAQ 
SDYVKKVNKA TEAAMFLGWV EQLVYDFTDE 
YKDDFVGALI FSGAVILLEF IPEIAIPVLG 
EVYKYIVTNW LAKVNTQIDL IRKKMKEALE 
DLSSKLNESI NKAMININKF LNQCSVSYLM 
GTLIGQVDRL KDKVNNTLST DIPFQLSKYV 
LIDLSRYASK INIGSKVNFD PIDKNQIQLF 
RIPKYFNSIS LNNEYTIINC MENNSGWKVS 
SDYINRWIFV TITNNRLNNS KIYINGRLID 
WIKYFNLFDK ELNEKEIKDL YDNQSNSGIL 
VGIRGYMYLK GPRGSVMTTN lYLNSSLYRG 
NKEYRLATNA SQAGVEKILS ALEIPDVGNL 
GFIGFHQFNN lAKLVASNWY NRQIERSSRT 



Peptides containing the motif 'N-X-S/T/C (X not P)': 

position peptide SEQ ID NO: 

20-40 VNNWDLFFSPSEDNFTNDLN K 25 

GEEITSDTNIEAAEENISLD LIQQYYLTFNFDNEPENISI 26 
^ * '^^ ENLS SDIIGQLELMPNIER 

336-342 LNESINK 27 

350-369 FL NOC SVSYLMNSMIPYGVK 28 

404-418 VNNTLSTDIPFOLSK 29 

435-445 NIINTSILNLR 30 

493-5 1 1 NAIVYNSMYENFSTSFWIR 3 1 

515-538 YFNSISLNNEYTIINCMENN SGWK 32 

564-576 YSOMINISDYINR 33 

587-591 LNMSK 34 

649-661 DLYDNQSNSGILK 35 

704-719 GSVMTTNIYLNSSLYR 36 

756-767 LATNASQAGVEK 37 

768-787 ILSALEIPDVGNLSQVVVMK 38 



Figure 2. 

Peptide sequence (SEQ ID NO: 39): 
MPFVNKQFNY KDPVNGVDIA YIKIPNAGQM 
PPPEAKQVPV SYYDSTYLST DNEKDNYLKG 
STIDTELKVI DTNCINVIQP DGSYRSEELN 
GSTQYIRFSP DFTFGFEESL EVDTNPLLGA 
RVFKVNTNAY YEMSGLEVSF EELRTFGGHD 
KSIVGTTASL QYMKNVFKEK YLLSEDTSGK 
LNRKTYLNFD KAVFKINIVP KVNYTIYDGF 
GLFEFYKLLC VRGIITSKTK SLDKGYNKAL 
ITSDTNIEAA EENISLDLIQ QYYLTFNFDN 
KKYELDKYTM FHYLRAQEFE HGKSRIALTN 
AMFLGWVEQL VYDFTDETSE VSTTDKIADI 
AVILLEFIPE lAIPVLGTFA LVSYIANKVL 
VNTQIDLIRK KMKEALENQA EATKAIINYQ 
MININKFLNQ CSVSYLMNSM IPYGVKRLED 
VNNTLSTDIP FQLSKYVDNQ RLLSTFTEYI 
GSKVNFDPID KNQIQLFNLE SSKIEVILKN 
EYTIINCMEN NSGWKVSLNY GEIIWTLQDT 
NNRLNNSKIY INGRLIDQKP ISNLGNIHAS 
EKEIKDLYDN QSNSGILKDF WGDYLQYDKP 
GSVMTTNIYL NSSLYRGTKF IIKKYASGNK 
GVEKILSALE IPDVGNLSQV VVMKSKNDQG 
LVASNWYNRQ lERSSRTLGC SWEFIPVDDG 



QPVKAFKIHN KIWVIPERDT FTNPEEGDLN 
VTKLFERIYS TDLGRMLLTS IVRGIPFWGG 
LVIIGPSADI IQFECKSFGH EVLNLTRNGY 
GKFATDPAVT LAHELIHAGH RLYGIAINPN 
AKFIDSLQEN EFRLYYYNKF KDIASTLNKA 
FSVDKLKFDK LYKMLTEIYT EDNFVKFFKV 
NLRNTNLAAN FNGQNTEINN MNFTKLKNFT 
NDLCIKVNNW DLFFSPSEDN FTNDLNKGEE 
EPENISIENL SSDIIGQLEL MPNIERFPNG 
SVNEALLNPS RVYTFFSSDY VKKVNKATEA 
TIIIPYIGPA LNIGNMLYKD DFVGALIFSG 
TVQTIDNALS KRNEKWDEVY KYIVTNWLAK 
YNQYTEEEKN NINFNIDDLS SKLNESINKA 
FDASLKDALL KYIYDNRGTL IGQVDRLKDK 
KNIINTSILN LRYESNHLID LSRYASKINI 
AIVYNSMYEN FSTSFWIRIP KYFNSISLNN 
QEIKQRVVFK YSQMINISDY INRWIFVTIT 
NNIMFKLDGC RDTHRYIWIK YFNLFDKELN 
YYMLNLYDPN KYVDVNNVGI RGYMYLKGPR 
DNIVRNNDRV YINVVVKNKE YRLATNASQA 
ITNKCKMNLQ DNNGNDIGFI GFHQFNNIAK 
WGERPL 



Peptides 


containing S or T (underlined): 


SEQ ID NO: 


position 


peptide 


49-66 


DIFTNPEEGDLNPPPEAK 


40 


67-84 


QVPVSYYDSIYLSIDNEK 


41 


90-93 


GVTK 


42 


98-105 


lYSIDLGR 


43 


106-113 


MLLTSIVR 


44 


114-128 


GIPFWGGSIIDIELK 


45 


129-145 


VIDINCINVIQPDGSYR 


46 


146-166 


SEELNLVIIGPSADIIQFEC K 


47 


167-177 


SFGHEVLNLTR 


48 


178-187 


NGYGSIQYIR 


49 


188-212 


FSPDFTFGFEESLEVDTOPL LGAGK 


50 


213-231 


FATDPAVTLAHELIHAGHR 


51 


245-264 


VNINAYYEMSGLEVSFEELR 


52 


265-272 


TFGGHDAK 


53 


273-283 


FIDSLQENEFR 


54 


292-299 


DIASILNK 


55 


302-314 


SIVGHASLQYMK 


56 


321-330 


YLLSEDISGK 


57 


331-335 


FSVDK 


58 


344-356 


MLTEIYIEDNFVK 


59 


365-371 


lYLNFDK 


60 


382-393 


VNYTIYDGFNLR 


61 


394-415 


NINLAANFNGQNIEINNMNF IK 


62 


418-427 


NFIGLFEFYK 


63 


433-438 


GIIISK 


64 


439-440 


TK 




441-444 


SLDK 


66 
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VNIN WlJLrr broil IJ IN rj_rNULrlN iv 


o / 


478-536 


nPFlT^riTMTFAAFFNKT O T TOOYYI TFNFDNFPENT^sT 


68 


nlNl^jbLlllOULcLiVlrlNinK. 


C AO C C C 

545-555 


Y J_IVlrri I LK 










500-JOl 


TAT TTUCVIVJFAT 1 XTPQR 


71 




V 1 I rrooxj I viv 


72 


jy /-ozo 


ATFA AMFT nWVFOf VYHFTn FT<\FV<sTTr)K 


73 


OZ /-O'l^y 




74 


05U-0OO 


nriFvri AT TPcnAVTi t fftp ftatpvt htfat v^vtami^ 

UL/r VOALrlro^OA VlLfLfHrlr niAlr^ V .LVJ /\.L> Vol IAIN rw 




689-701 


V LX V y XI I-' ALolv 


/o 


719 79n 
/ 1Z-/ZU 


VT\/T>JWT Al^ 


77 


Til TOO 


\/XTXr*TrM TP 


751 


71/1 7/1/1 


FAT FMOAFATK' 


79 


7/1 < 7^Q 


A Trwvnv>jnvTFFFK' 

AllIN I I INl^ I X^^*-'^ 




7ou-/ /2 


XTXTTXTCXTirM^T CCV 

iNiNlINrlNlUULj jIv 


SI 
o 1 


7/J-//V 


1 XTT^CTXTV 




/o /-oUo 


FT Xinr'QVQVT \>rMQ\4TPVn\/l^ 
r LrfN l^L/ o V o I LMIN oM Ir I (J V l\. 




QAQ Q 1 iC 

iSUo-olO 


1 T7T^Pr> A CT liT 
LrCUr U Ao L.l\. 


ot 


o2o-oJO 


nxT Tnovnp 


O J 


OH-l-O 




86 


C/C7 C71 


T T QXFXFVTy 
L,L,o 1 r 1 E> I IK 


X7 


C77 Qfi7 
O /2-002 


XJTrWTQTT TMT P 


oo 




YFWHT TDT 


89 


8QA 807 


VA^i<r 

I 




QAO AAO 


IJNlOolv 


01 
y 1 


Q 1 7 07 T 


XT/^T/^T TTXTT T7CCV 
IN l^lV^Lrr IN L>E>JolV 


07 
yz 


Q-IA O/IO 


>J A TVVM^MVFMF^T^FWTP 




Q'7< 

952-9 /j 


V17XTCTCT XTXTCVXTTXTf^A/fT^XTM Qn\\7T^ 
YrlNMoLININli I XAAr^UMlliNIN oVjWlV 


Od 


Q7/^ OQ/f 


VQT >JVnFTTWTT OT^TOFTl^ 
V oi-(iN T UCIi W \JL\^iJ \ \^i-4kr^ 




1UUI-1U13 


V Q OlV/f TXI T Q V IXJ P 
I o vMirN loL/ 1 IIN Iv 


0/^ 


1 A 1 >1 1 A7 "5 


\\/TT7\/XTXXrMP 
Wlr VXAX*NlNK 


07 
y / 


1 AO A 1 AO 0 


T XTXTCl/ 


OS 
yo 




T Tnni^PTQTSJT rj>JT14A^>J>JTM Fl^ 
L»lL^V^lM^lo.lNL»OINln.AoININlIVl rfw 


Q9 


inA7 in^^ 


T^TT4P 

L/ I XIAV 


100 


lAOiC lAno 
lUoO-lU9o 


Tw vr^xir^cxTcriTT v 
ULf Y L>lN V^oIN oVjILpl 


101 

lU 1 


11/11 11 <A 
1 141-1 1 DO 


nQVMTTXIlVT >JQ<iT VP 
VJo V IVl 1 1 IN 1 1 L»1N ooL< I iv 


102 


1 157-1 159 


OXlv 




1 1 iCc 11 n(\ 
1 lo5-l 1 /U 


Y AbOINK 


lOd 


1 1 yj-izu^ 


T ATNA^sOAriVFK 


103 


1205-1224 


ILSALEIPDVGNLSQVVVMK 


70 


1225-1226 


SK 




1227-1234 


NDQGIINK 


65 


1261-1269 


LVASNWYNR 


24 


1274-1276 


SSR 




1277-1296 


ILGCSWEFIPVDDGWGERPL 


105 



